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Abstract 

Camera  networks  are  widely  used  for  tasks  such 
as  surveillance,  monitoring  and  tracking.  In  or¬ 
der  to  accomplish  these  tasks,  knowledge  of  lo¬ 
calization  information  such  as  camera  locations 
and  other  geometric  constraints  about  the  envi¬ 
ronment  (e.g.  walls,  rooms,  and  building  layout) 
are  typically  considered  to  be  essential.  How¬ 
ever,  this  information  is  not  always  required  for 
many  tasks  such  as  estimating  the  topology  of 
camera  network  coverage,  or  coordinate- free  ob¬ 
ject  tracking  and  navigation.  In  this  paper, 
we  propose  a  simplicial  representation  (called 
CA^-Complex)  that  can  be  constructed  from  dis¬ 
crete  local  observations  from  cameras,  and  uti¬ 
lize  this  novel  representation  to  recover  the  topo¬ 
logical  information  of  the  network  coverage.  We 
prove  that  our  representation  captures  the  cor¬ 
rect  topological  information  from  network  cov¬ 
erage  for  2.5D  layouts,  and  demonstrate  their 
utility  in  simulations  as  well  as  a  real-world 
experimental  set-up.  Our  proposed  approach 
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is  particularly  useful  in  the  context  of  ad-hoc 
camera  networks  in  indoor/outdoor  urban  envi¬ 
ronments  with  distributed  but  limited  computa¬ 
tional  power  and  energy. 

1  Introduction 

Future  generations  of  sensor  networks  are  in¬ 
variably  going  to  include  multiple  types  of  sen¬ 
sors  -  including  spatial  sampling  sensors  such 
as  cameras  or  active  range  scanners.  Sensors 
like  cameras  will  be  the  dominant  consumers 
of  bandwidth  and  power  in  such  heterogenous 
sensor  networks.  Thus,  a  clear  understanding 
of  the  constraints  (such  as  bandwidth  consump¬ 
tion,  power  consumption,  spatio-temporal  sam¬ 
pling)  posed  by  camera  sensors  in  the  context 
of  computation  and  communication  will  play  a 
critical  role  in  defining  the  bounds  for  feasibil¬ 
ity  of  performing  certain  tasks  in  a  heterogenous 
sensor  network.  In  other  words,  such  an  under¬ 
standing  in  the  context  of  cameras  could  tell  us 
whether  our  design  of  the  heterogenous  network 
will  be  able  to  perform  the  designated  task  or 
not  -  and  what  conditions  are  necessary  in  order 
to  perform  such  tasks. 

Identification  of  the  exact  location  of  targets 
and  objects  in  an  environment  is  essential  for 
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many  surveillance  applications  in  the  realm  of 
sensor  networks.  However,  there  are  situations 
in  which  the  localization  of  the  sensors  is  not 
known  (e.g.  unavailability  of  GPS,  or  ad-hoc 
network  setup).  A  common  approach  to  over¬ 
coming  this  challenge  has  been  to  determine  the 
exact  localization  of  the  sensors  and  reconstruc¬ 
tion  of  the  surrounding  environment.  Neverthe¬ 
less,  we  will  provide  evidence  supporting  the  hy¬ 
pothesis  that  many  of  the  tasks  at  hand  may  not 
require  exact  localization  information.  For  in¬ 
stance,  when  tracking  individuals  in  an  airport, 
we  may  want  to  know  whether  they  are  in  the 
vicinity  of  a  specific  gate.  In  this  scenario,  it  is 
not  absolutely  necessary  to  know  their  exact  lo¬ 
cation.  Another  example  is  navigation  through 
an  urban  environment.  This  task  can  be  accom¬ 
plished  by  making  use  of  target  localization  and 
a  set  of  directions  such  as  where  to  turn  right, 
and  when  to  keep  going  straight.  In  both  situa¬ 
tions,  a  general  description  of  our  surroundings 
and  the  target  location  is  sufficient.  The  type 
of  information  that  we  desire  is  a  topological  de¬ 
scription  of  the  environment  that  captures  the 
appropriate  structure  of  the  environment. 

One  of  the  fundamental  questions  in  the  con¬ 
text  of  camera  networks  is  whether  a  network  is 
limited  to  perform  only  tasks  that  a  single  cam¬ 
era  can  perform  but  at  a  larger  scale,  or  if  the 
total  network  is  “greater”  than  the  sum  of  the 
parts.  Imagine  a  camera  network  where  no  inter¬ 
relationship  between  the  cameras  is  known.  It 
is  natural  to  ask  what  the  spatial  relationship 
between  cameras  is.  For  an  application  such  as 
surveillance  in  which  multiple  views  are  certainly 
useful,  we  investigate  how  object  tracking  infor¬ 
mation  from  multiple  cameras  can  be  aggregated 
and  analyzed.  A  related  and  important  ques¬ 
tion  here  is  as  to  how  we  manage  the  processing 
and  flow  of  data  between  the  cameras.  We  note 


that  all  of  these  questions  can  be  approached  us¬ 
ing  knowledge  of  the  topology  of  the  coverage  of 
the  network.  In  particular,  topology  awareness 
makes  it  possible  to  design  more  efficient  routing 
and  broadcasting  schemes  as  it  is  discussed  by  M. 
Li  et  al  [9].  This  knowledge  in  turn  can  also  aid 
with  control  mechanism  for  more  energy-efficient 
usage. 

Figure  1  serves  as  a  didactic  tool  to  under¬ 
stand  the  information  required  for  our  approach 
to  coordinate  free  tracking  and  navigation  prob¬ 
lems.  Observe  that  the  complete  floor  plan 
(left)  and  corresponding  abstract  representation 
(right)  serves  an  equivalent  purpose.  The  ab¬ 
stract  representation  allows  us  to  track  a  target 
and  navigate  through  the  environment.  Our  goal 
in  this  context  is  to  use  the  continuous  observa¬ 
tions  from  camera  nodes  to  extract  the  necessary 
symbols  to  create  this  representation. 
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Figure  1:  A  physical  (left)  and  an  abstract 
(right)  layout  of  an  environment  are  compared. 
In  both  cases  we  observe  a  target  and  the  corre¬ 
sponding  path  for  its  motion. 


In  this  paper,  we  consider  a  camera  network 
where  each  camera  node  can  perform  local  com¬ 
putations,  extract  some  symbolic/discrete  obser¬ 
vations  to  be  transmitted  for  further  processing. 
This  conversion  to  symbolic  representation  alle¬ 
viates  the  communication  overhead  for  a  wire¬ 
less  network.  We  then  use  these  discrete  ob¬ 
servations  to  build  a  model  of  the  environment 
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without  any  prior  localization  information  of  ob¬ 
jects  or  the  cameras  themselves.  Once  such  non¬ 
metric  reconstruction  of  the  camera  network  is 
accomplished,  this  representation  can  be  used  for 
tasks  such  as  coordinate-free  navigation,  target¬ 
tracking,  and  path  identification. 

The  rest  of  the  discussion  is  as  follows.  We 
first  start  with  a  brief  and  informal  discussion 
about  different  approaches  to  capturing  topolog¬ 
ical  information  in  sensor  networks  and  discuss 
the  related  work  in  this  domain.  We  then  in¬ 
troduce  the  algebraic  topological  tools  used  for 
constructing  our  model.  Next,  we  discuss  how 
the  topological  recovery  (or  non-metric  recon¬ 
struction)  of  the  camera  network  can  be  done 
in  2.5D  along  with  simulation  and  experiments. 
Appendix  A  provides  a  brief  introduction  to  the 
algebraic  topological  tools  and  terminology  used 
for  this  work. 

2  Related  Work 

Finding  the  topology  of  a  domain  embedded  in 
is  closely  related  to  detecting  holes.  There 
has  been  much  work  on  the  detection  and  re¬ 
covery  of  holes  by  topological  methods  for  sen¬ 
sor  networks,  most  of  which  consider  symmetric 
coverage  (explicitly  or  implicitly)  or  high  enough 
density  of  sensors  in  the  field.  In  particular,  Vin 
de  Silva  and  Christ  [6]  obtain  the  Rips  complex 
based  on  the  communication  graph  of  the  net¬ 
work  and  compute  homologies  using  this  repre¬ 
sentation.  These  methods  assume  some  symme¬ 
try  in  the  coverage  of  each  sensor  node  (such 
as  circular  coverage),  however,  such  assumptions 
are  not  valid  for  camera  networks.  Spatial  sam¬ 
pling  of  plenoptic  function  [2]  from  a  network  of 
cameras  is  rarely  i.i.d.  (independent  and  identi¬ 
cally  distributed).  The  notion  of  spatial  coher¬ 


ence  encountered  in  the  context  of  camera  net¬ 
works  is  not  handled  in  traditional  sensor  net¬ 
work  literature. 

Connectivity  between  overlapping  camera 
views  by  determining  the  correspondence  mod¬ 
els  between  cameras  and  extracting  homogra- 
phy  models  has  been  approached  by  Stauffer 
and  Tieu  [15].  Cheng  et  al  [5]  build  a  vi¬ 
sion  graph  in  a  distributed  manner  by  exchang¬ 
ing  feature  descriptors  from  each  camera  view. 
In  their  work,  each  camera  encodes  a  spa¬ 
tially  well-distributed  set  of  distinctive,  approx¬ 
imately  viewpoint- invariant  feature  points  into 
a  fixed-length  “feature  digest”  that  is  broadcast 
throughout  the  network  to  establish  correspon¬ 
dence  between  cameras.  Yeo  et  al  [16]  utilize  a 
distributed  source  coding  framework  to  exchange 
compact  feature  descriptors  in  a  rate-efficient 
manner  to  establish  correspondence  between  var¬ 
ious  camera  views. 

Marinakis  et  al  [11]  work  on  finding  connec¬ 
tivity  between  non-overlapping  coverage  of  cam¬ 
eras  by  using  only  reports  of  detection  and  no 
description  of  the  target.  They  use  a  Markov 
model  for  modeling  the  transition  probabilities 
and  minimize  a  functional  using  Markov  Chain 
Monte  Carlo  Sampling.  They  also  present  a 
different  formulation  of  the  same  problem  with 
“timestamp  free”  observation  with  only  order¬ 
ing  available  (still  no  target  description)  [12]. 
Other  approaches  to  solving  the  same  problem 
with  target  identification  have  been  explored  by 
Zou  et  al  [17].  Camera  network  with  overlaps 
have  been  studied  using  the  statistical  consis¬ 
tency  of  the  observation  data  by  Makris  et  al[10]. 
Rahimi  et  al  [14]  describe  a  simultaneous  cali¬ 
bration  and  tracking  algorithm  (with  a  networks 
of  non-overlapping  sensors)  by  using  velocity  ex¬ 
trapolation  for  a  single  target. 
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3  The  Environment  Model 

3.1  The  Problem  in  2.5D 

Our  problem  will  be  defined  in  terms  of  the  de¬ 
tection  of  a  target  moving  through  an  environ¬ 
ment.  For  the  sake  of  mathematical  clarity,  we 
first  focus  on  the  case  of  a  single  target  mov¬ 
ing  through  the  environment.  Let  us  start  by 
describing  our  setup: 

The  Environment  in  2.5D  :  We  consider  a 
domain  in  3D  with  the  following  constraints: 

•  All  objects  and  cameras  in  the  environ¬ 
ment  will  be  within  the  space  defined  by  the 
planes  z  =  0  (the  “floor” )  and  z  =  hmax  (f he 
“ceiling”). 

•  Objects  in  the  environment  consists  of  static 
“walls”  erected  perpendicular  to  our  plane 
from  z  =  0  to  z  =  hmax-  The  perpendicular 
projection  of  the  objects  to  the  plane  z  =  0 
must  have  a  piecewise  linear  boundary.  Ob¬ 
jects  must  enclose  a  non-zero  volume. 

Cameras  in  2.5D  :  A  camera  a  has  the  fol¬ 
lowing  properties: 

•  It  is  located  at  position  with  an  arbi¬ 
trary  3D  orientation  and  a  local  coordinate 
frame 

•  Its  camera  projection  in  3D  ,  : 

^  is  given  by 

=  iPxlPz,PylPz), 

where  p  is  given  in  coordinate  frame 
and  Toi  C  ({(a:,  y^z)\z  >  0}),  referred  to  as 
the  field  of  view  (FOV)  of  the  camera,  is 
an  open  convex  set  such  that  its  closure  is 
a  convex  cone  based  at  The  image  of 

this  mapping,  i.e.  will  be  called 

the  image  domain 


The  Target  in  2.5D  :  A  target  will  have  the 
following  properties: 

•  The  target  will  be  a  line  segment  perpen¬ 
dicular  to  the  bounding  planes  of  our  do¬ 
main  which  connects  the  points  (x,^,0)  to 
(x,^, /it),  where  x  and  y  are  arbitrary  and 
ht  <  hmax  is  the  height  of  the  target.  The 
target  is  free  to  move  along  the  domain  as 
long  as  it  does  not  intersect  any  of  the  ob¬ 
jects  in  the  environment. 

•  A  target  is  said  to  be  detected  by  camera 

a  if  there  exists  a  point  p  :=  {x^y^z)  in 

the  target  such  that  p  G  Tol  and  p  does 
not  interscect  any  of  the  objects  in  the  en¬ 
vironment. 

Note  that  these  assumptions  may  seem  very 
restrictive,  but  they  are  satisfied  by  most  camera 
networks  in  indoor  and  outdoor  environments. 
Also,  some  of  these  choices  in  our  model  (such 
as  the  vertical  line  target  and  polygonal  objects) 
are  made  in  order  to  simplify  our  analysis.  We 
will  see  that  our  methods  work  in  real-life  sce¬ 
narios  through  our  experiments. 

The  example  in  figure  2  shows  a  target  and  a 
camera  with  its  corresponding  FOV. 

Problem  1  (2.5D  Case):  Given  the  eamera 
and  environment  models  in  2.5D  ,  our  goal  is  to 
obtain  a  representation  that  eaptures  the  topolog- 
ieal  strueture  of  the  detectable  set  for  a  cam¬ 
era  network  (i.e.,  the  union  of  the  sets  in  whieh 
a  target  is  deteetable  by  a  eamera).  The  eon- 
struetion  of  this  representation  should  not  rely 
on  eamera  or  objeet  loealization. 

The  formulation  of  the  problem  is  very 
generic.  We  are  choosing  a  simplicial  represen¬ 
tation  because  we  are  after  a  combinatorial  rep- 
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Figure  2:  Mapping  from  2.5D  to  2D  :  A  cam¬ 
era  and  its  field  of  view  (FOV)  are  shown  from 
multiple  perspectives  (left  and  middle).  The  cor¬ 
responding  mapping  of  this  configuration  to  2D 
is  shown  on  the  right.  For  the  2.5D  configura¬ 
tion,  the  planes  displayed  bound  the  space  that 
can  be  occupied  by  the  target. 

resentation  that  does  not  contain  metric  infor¬ 
mation.  We  are  also  after  a  distributed  solution, 
i.e.  processing  information  at  local  nodes. 

3.2  Mapping  from  2.5D  to  2D 

The  structure  of  the  detectable  set  for  a  camera 
network  becomes  clear  through  an  identification 
of  our  2.5D  problem  to  a  2D  problem.  Since 
the  target  is  constrained  to  move  along  the  floor 
plane,  it  is  possible  to  map  our  problem  to  a  2D 
problem.  In  particular: 

•  Cameras  located  at  locations  {x^y^z)  are 
mapped  to  location  {x^y)  in  the  plane. 

•  Objects  in  our  2.5D  domain  are  mapped  to 
objects  with  piecewise  linear  boundaries  in 
the  plane. 

•  We  can  also  do  a  simple  identification  be¬ 
tween  the  FOV  of  a  camera  to  a  domain 

of  a  camera  in  2D  .  A  point  (x,  y)  in  the 
plane  is  in  if  the  target  located  at  that 
point  intersects  the  FOV  To.-  The  set 
is  the  orthogonal  projection  (onto  the  xy- 
plane)  of  the  intersection  between  and 


the  space  between  z  >  0  and  z  <  htarget- 
Since  the  latter  is  an  intersection  of  con¬ 
vex  sets,  and  orthogonal  projections  pre¬ 
serve  convexity,  then  Vc^  is  convex.  We  can 
also  check  that  will  be  open. 

•  Also,  we  can  give  a  2D  description  of  the 
coverage  of  a  camera.  A  point  (x,  y)  is  in 
the  coverage  Ca  of  camera  a  if  the  target 
located  at  (x,  y)  is  detectable  by  the  camera. 

3.3  The  Problem  in  2D 

We  now  proceed  by  characterizing  our  problem 
after  mapping  the  original  configuration  from  a 
2.5D  space  to  2D  .  The  following  definitions  are 
presented  to  formalize  our  discussion. 

The  Environment:  The  space  under  consid¬ 
eration  is  similar  to  the  one  depicted  in  figure 
1  (left),  where  cameras  are  located  in  the  plane, 
and  only  sets  with  piecewise-linear  boundaries 
are  allowed  (including  object  and  paths).  We  as¬ 
sume  a  finite  number  of  objects  in  our  environ¬ 
ment. 

Cameras:  A  camera  object  a  is  specified  by: 
its  position  Oa  in  the  plane;  and  an  open  convex 
domain  Pq,,  referred  to  as  the  camera  domain. 

The  camera  domain  Pq.  can  be  interpreted  as 
the  set  of  points  visible  from  camera  a  when  no 
objects  occluding  the  field  of  view  are  present. 
The  convexity  of  this  set  will  be  essential  for 
some  of  the  proofs.  Some  examples  of  camera 
domains  are  shown  in  figure  4. 

Definition  1  The  subset  of  the  plane  oeeupied 
by  the  i-th  object,  whieh  is  denoted  by  Oi,  is  a 
eonneeted  elosed  subset  of  the  plane  with  non¬ 
empty  interior  and  pieeewise  linear  boundary. 
The  eolleetion  where  Nq  <  oo  is  the 
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number  of  objects  in  the  environment,  will  be  re¬ 
ferred  to  as  the  objects  in  the  environment. 

Definition  2  Given  a  camera  a,  a  point  p  G 
is  said  to  be  visible  from  camera  a  if  p  ^ 
Va  and  odjp  H  where  ojjp  is  the 

line  between  the  camera  location  o^  and  p.  The 
set  of  visible  points  is  called  the  coverage  Ca  of 
camera  a. 

We  consider  the  following  problem: 

Problem  2  (2D  Case):  Given  the  camera  and 
environment  models  in  2D  ,  our  goal  is  to  ob¬ 
tain  a  simplicial  representation  that  captures  the 
topological  structure  of  the  coverage  of  the 
camera  network  (i.e.,  the  union  of  the  cov¬ 
erage  of  the  cameras).  The  construction  of  this 
representation  should  not  rely  on  camera  or  ob¬ 
ject  localization. 

Observation  1  Note  that  the  camera  network 
coverage  has  the  same  homology  (i.e.  topological 
information)  as  the  domain  (M^  —  yjOi)  if  these 
two  sets  are  homotopic  (i.e.,  we  can  continuously 
deform  one  into  the  other). 

4  The  CN-Complex 

Our  goal  is  the  construction  of  a  simplicial  com¬ 
plex  that  will  capture  the  homology  of  the  union 
of  camera  coverage  IjCc^.  One  possible  approach 
for  accomplishing  this  task  is  to  obtain  the  nerve 
complex  (see  appendix  A)  using  the  set  of  cam¬ 
era  coverage  {Co}-  However,  this  approach  will 
only  work  for  simple  configurations  without  ob¬ 
jects  in  the  domain.  An  example  illustrating  our 
claim  is  shown  in  figure  3. 

The  reason  figure  3  (right)  does  not  capture 
the  topological  structure  of  the  coverage  is  be¬ 
cause  the  hypothesis  of  the  Cech  Theorem  (see 


Figure  3:  Examples  illustrating  nerve  complexes 
obtained  using  the  collection  of  camera  coverage 
{C(y}>  One  complex  captures  the  correct  topo¬ 
logical  information  (left)  but  the  other  does  not 
(right). 

appendix  A)  is  not  satisfied  (in  particular,  CinC2 
is  not  contractible).  From  the  physical  layout  of 
the  cameras  and  the  objects  in  the  environment, 
it  is  clear  how  we  can  divide  C\  in  order  to  obtain 
contractible  intersections.  We  are  after  a  a  de¬ 
composition  of  the  coverage  that  can  be  achieved 
without  knowing  the  exact  location  of  objects  in 
the  environment. 

4.1  The  Decomposition  Theorem 

Before  we  proceed  let  us  consider  the  following 
useful  definitions: 

Definition  3  Given  the  objects  {Oi}f^^,  a 
piecewise  linear  path  F  :  [0, 1]  ^  is  said  to 
be  feasible  i/F([0, 1])  H  {[jOi)  =  0. 

Definition  4  Given  camera  a  with  camera  do¬ 
main  Va  and  corresponding  boundary  dVa,  a 
line  La  is  a  bisecting  line  for  the  camera  if: 

•  La  goes  through  the  camera  location  Oa- 

•  There  exists  a  feasible  path  F  :  [0, 1]  ^ 
such  that  for  any  e  >  0  there  exists  a  6  such 
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that  0  <C  (5  <C  6^  r(0.5  —  ^  r(0.5  “h  (5)  ^ 

Cc,;  r(0.5)  G  Lo,,  and  r(0.5)  ^  dV^,. 

If  we  imagine  a  target  traveling  through  the 
path  r,  we  note  that  the  last  condition  in  the  def¬ 
inition  of  a  bisecting  line  identifies  when  an  oc¬ 
clusion  event  is  detected  (i.e.,  the  target  tran¬ 
sitions  from  visible  to  not  visible,  or  viceversa). 
However,  we  will  ignore  the  occlusion  events  due 
to  the  target  leaving  through  the  boundary  of 
the  camera  domain  Pq,. 


individual  camera.  This  construct  will  capture 
the  correct  topological  structure  of  the  coverage 
of  the  network. 

Figure  5  displays  examples  of  (7A^-complexes 
obtained  after  decomposing  the  coverage  of  each 
camera  using  their  corresponding  bisecting  lines. 
The  CA^-complex  captures  the  correct  topolog¬ 
ical  information,  given  that  we  satisfy  the  as¬ 
sumptions  made  for  the  model  described  in  sec¬ 
tion  3.  The  following  theorem,  which  proof  can 
be  found  in  appendix  B,  states  this  fact. 


Definition  5  Let  {La^i}^i  be  a  finite  eolleetion 
of  biseeting  lines  for  eamera  a.  Consider  the  set 
of  adjaeent  eones  in  the  plane  bounded 

by  these  lines,  where  Nq  —  2^^,  then  the  de¬ 
composition  of  Ca  by  lines  {La^i}  is  the  eol¬ 
leetion  of  sets 


Note  that  the  decomposition  of  Ca  is  not  a 
partition  since  the  sets  Caj  are  not  necessarily 
disjoint. 


Figure  4:  Three  examples  of  camera  domains 
Va>  Note  that  cameras  can  be  inside  or  out¬ 
side  these  sets.  Our  camera  model  spans  projec¬ 
tion  models  from  perspective  earners  to  omni¬ 
directional  cameras.  Examples  of  decomposi¬ 
tions  are  shown  for  each  set  Ca- 
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Figure  5:  Examples  of  (7A^-complexes.  On  the 
left,  camera  1  is  decomposed  into  three  regions, 
each  of  which  becomes  a  different  vertex  in  our 
complex.  On  the  right,  cameras  1  and  2  are  both 
decomposed  into  three  regions. 


Theorem  1  (Decomposition  Theorem) 

Let  {Ca}a=i  be  a  eolleetion  of  eamera  eoverage 
where  eaeh  Ca  is  eonneeted  and  N  is  the  number 
of  eameras  in  the  domain.  Let  {Ca^k}{a,k)eAD 
the  eolleetion  of  deeomposed  sets  by  all  possible 
biseeting  lines,  where  Ajj  is  the  set  of  indiees  in 
the  deeomposition.  Then,  any  finite  interseetion 
n(a'  k')eA^oL'  ,kA  where  A  is  a  finite  set  of  indiees, 
is  eontraetible. 

Hence,  the  hypothesis  of  the  Cech  Theorem 
is  satisfied  if  we  have  connected  coverage  which 
are  decomposed  by  all  of  their  bisecting  lines. 
This  implies  that  computing  the  homology  of  the 


The  construction  of  the  camera  network 
complex  (CA^-complex)  is  based  on  the  identi¬ 
fication  of  bisecting  lines  for  the  coverage  of  each 
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CA^-complex  returns  the  appropriate  topological 
information  about  the  network  coverage. 

Observation  2  Note  that  there  are  many  ways 
to  deeompose  a  set  in  order  to  obtain  subsets  with 
eontraetible  inter seetions.  However,  by  using  the 
biseeting  lines,  we  ensure  that  the  deeomposition 
ean  be  done  loeally  (at  eaeh  eamera  node)  without 
knowledge  of  the  physieal  strueture  of  the  envi¬ 
ronment. 

We  note  that  the  steps  required  to  build  the 
CWcomplex  are  two-fold: 

•  Identify  all  bisecting  lines  and  decompose 
each  camera  coverage. 

•  Determine  which  of  the  resulting  sets  inter¬ 
sect. 

The  first  step  makes  sure  that  any  intersection 
will  be  contractible.  The  second  step  allows  us  to 
find  the  simplices  for  our  representation.  These 
two  steps  can  be  completed  in  different  ways 
which  depend  on  the  scenario  under  consider¬ 
ation.  In  the  section  5  and  6,  we  illustrate  the 
construction  of  the  CWcomplex  for  a  very  spe¬ 
cific  scenario. 

4.2  From  2D  to  2.5D 

We  can  build  the  CiV-complex  by  decomposing 
each  camera  coverage  using  its  bisecting  lines 
and  determining  which  of  the  resulting  sets  in¬ 
tersect.  However,  a  physical  camera  only  has 
access  to  observations  available  in  its  image  do¬ 
main  Therefore,  it  is  essential  to  determine 
how  to  find  bisecting  lines  using  information  in 
the  image  domain. 

We  note  that  occlusion  events  occur  when  the 
target  leaves  the  coverage  Ca  of  camera  a  along 
the  boundary  of  the  camera  domain  or  along 


a  bisecting  line.  We  can  verify  that  a  target  leav¬ 
ing  through  the  boundary  of  Vc^  will  be  detected 
in  the  image  domain  as  having  the  target 
disappering/appearing  through  the  boundary  of 
If  the  target  leaves  Ca  through  one  of  the 
bisecting  lines,  we  will  observe  an  occlusion  event 
in  the  interior  of  Note  that  bisecting  lines 

in  the  2D  domain  correspond  to  vertical  planes  in 
the  2.5D  configuration,  whose  intersection  with 
the  FOV  of  the  camera  map  to  lines  in 
Hence,  all  that  is  required  is  to  find  the  line  seg¬ 
ment  in  which  an  occlusion  event  takes  place  in 
the  image  domain.  From  an  engineering  point  of 
view,  this  can  be  done  by  performing  some  sim¬ 
ple  image  processing  to  find  the  edge  along  which 
target  disappears/appears  in  an  image.  The  re¬ 
sult  will  be  a  decomposition  of  the  image  domain 
which  will  correspond  to  a  decomposition 
of  the  camera  coverage  Ca-  We  also  emphasize 
that  these  computations  can  be  done  locally  at 
a  camera  node  without  any  need  to  transmit  in¬ 
formation. 

The  problem  of  finding  intersections  of  the 
sets  for  the  2D  problem  corresponds  to  having 
concurrent  detections  at  corresponding  cameras 
for  the  case  of  a  single  target  in  the  environ¬ 
ment.  Finding  overlap  between  these  regions 
can  be  solved  for  the  multiple-target  case  by 
using  approaches  such  as  the  ones  outlined  in 
[11,  12,  17,  16,  5]  in  which  correspondence  and 
time  correlation  are  exploited. 

5  Simulations  in  2D 

We  consider  a  scenario  similar  to  the  one  shown 
in  figure  1  (left)  in  which  a  wireless  camera  net¬ 
work  is  deployed  and  no  localization  informa¬ 
tion  is  available.  Each  camera  node  will  be  as¬ 
sumed  to  have  certain  computational  capabilities 


and  they  can  communicate  wirelessly  with  each 
other. 

The  assumptions  for  this  particular  simulation 
are: 

The  Environemt  in  Simulation:  The  ob¬ 
jects  in  the  environment  will  have  piecewise  lin¬ 
ear  boundaries  as  described  earlier.  The  location 
of  the  objects  will  be  unknown.  The  location  and 
orientation  of  the  cameras  is  also  unkown. 

Cameras  in  Simulation:  A  camera  a  has  the 
following  properties: 

•  The  domain  of  a  camera  in  2D  will  be 
the  interior  of  a  convex  cone  with  field  of 
view  9oc  <  180^.  We  use  this  model  for  sim¬ 
plicity  in  our  simulations. 

•  A  local  camera  frame  is  chosen  such 
that  the  range  of  the  field  of  view  is 
[— 0c^/2,  0Q./2]  when  measured  from  the  y- 
axis. 

•  Its  camera  projection  :  Pq.  ^  M,  is 

given  by 

{p)  =  Px/Py, 

where  p  is  given  in  coordinate  frame 
The  image  of  this  mapping,  i.e. 
will  be  called  the  image  domain 

The  Target  in  Simulation:  A  single  point 
target  is  considered  in  order  to  focus  on  the  con¬ 
struction  of  the  complex  without  worrying  about 
correspondence/identification  of  our  target. 

Throuhgout  our  simulations  we  will  have  the 
target  moving  continuously  through  the  environ¬ 
ment.  At  each  time  step  the  cameras  compute 
their  detections  of  the  target  and  use  their  obser¬ 
vations  to  detected  bisecting  lines.  Observations 


at  the  regions  obtained  after  decomposition  us¬ 
ing  the  bisecting  lines  are  stored.  These  obser¬ 
vations  are  then  combined  to  determine  inter¬ 
sections  between  the  regions  which  become  sim- 
plices  in  the  CWcomplex. 

As  mentioned  before,  the  topology  of  the  en¬ 
vironment  can  be  characterized  in  terms  of  its 
homology.  In  particular  we  will  use  betti  num¬ 
bers  /3o  and  /?i  (see  appendix  A) .  The  /3o  number 
tells  us  the  number  of  connected  components  in 
the  coverage  while  jSi  gives  the  number  of  holes. 
The  PLEX  software  package  [1]  is  used  for  com¬ 
puting  the  homologies  and  corresponding  betti 
numbers. 


Figure  6:  On  the  left,  a  three-camera  layout 
with  two  occluding  objects,  where  C3  is  shown. 
On  the  right,  a  four-camera  layout  in  a  circular 
hallway-type  configuration,  where  Ci  is  shown. 
Dashed  lines  represent  corresponding  bisecting 
lines.  Dotted  curves  are  the  paths  followed  by 
the  target. 

Figure  6  (left)  is  a  three-camera  layout  with 
two  objects  in  their  field  of  view.  In  this  case,  we 
observe  three  bisecting  lines  for  camera  1,  two  for 
camera  2,  and  four  for  camera  3.  The  coverage 
C3  is  decomposed  into  5  regions,  namely  {Cs^a^ 
^3,6,  C?>4  2tnd  C3^e}-  The  list  of  maximal  sim- 

plices  obtained  by  our  algorithm  is:  [la  16  Ic  Id], 
[2a  26  2c],  [3a  36  3c  3d  3e],  [la  16  2c  3c], 
[Id  2a  3c],  [2a  26  3a],  [la  26  2c  3a],  [la  2c  3a  36], 
[la  2c  36  3c],  [la  2c  3c  3d],  [la  16  Ic  2c  3d  3e], 
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[Ic  Id  3e]  and  [Id  2a  3e].  The  homology  com¬ 
putations  returned  betti  numbers:  (3o  —  1  and 
[di  —  2.  This  agrees  with  having  a  single  con¬ 
nected  component  for  the  network  coverage  and 
two  objects  inside  the  coverage  of  the  cameras. 

In  figure  6  (right)  we  observe  similar  results  for 
a  configuration  that  can  be  interpreted  as  a  hall¬ 
way  in  a  building  floor.  There  is  a  single  bisect¬ 
ing  line  for  all  cameras.  Our  algebraic  analysis 
returns  /3o  =  1  and  j3i  —  1.  The  latter  identifies 
a  single  hole  corresponding  to  the  loop  formed 
by  the  hallway  structure.  The  list  of  maximal 
simplices  recovered  by  our  algorithm:  [3b  4a  46] , 
[26  3a  36],  [16  2a  26],  [la  16  46]. 

6  Experimentation 

In  order  to  demonstrate  how  the  mathematical 
tools  described  in  the  previous  sections  can  be 
applied  to  a  real  wireless  sensor  network,  we 
setup  an  experiment  tracking  a  robot  in  a  sim¬ 
ple  maze.  Figure  7  shows  the  layout  to  be  used. 
We  placed  a  sensor  network  consisting  of  CIT¬ 
RIC  camera  motes  [4]  at  several  locations  in  our 
maze  and  let  a  robot  navigate  through  the  en¬ 
vironment.  The  CWcomplex  is  constructed  for 
this  particular  coverage  and  used  for  tracking  in 
this  representation.  The  homologies  were  com¬ 
puted  using  the  PLEX  software  package  [1]. 

Time  synchronization  is  required  in  order  to 
determine  overlaps  between  the  different  camera 
regions.  This  is  accomplished  by  having  all  of 
the  camera  nodes  sharing  time  information  with 
one  another. 

The  model  is  constructed  by  having  each  cam¬ 
era  node  perform  local  computations.  Each  cam¬ 
era  node  first  looks  for  bisecting  lines  (as  shown 
in  figure  8)  in  order  to  decompose  its  coverage, 
and  the  detections  of  the  target  on  the  corre- 


Figure  7:  Layout  used  for  our  experiment.  Left: 
a  diagram  showing  the  location  of  the  different 
cameras;  Bottom  Right:  A  picture  of  our  exper¬ 
imental  maze.  Top  Right:  The  CITRIC  camera 
motes  used  for  our  experiments. 


spending  regions  are  stored  over  time.  In  our 
implementation,  occlusion  lines  are  detected  by 
looking  for  occlusion  events  over  time.  If  this 
event  did  not  correspond  to  an  occlusion  along 
the  boundary  of  the  image  domain,  then  we  es¬ 
timate  an  occlusion  line.  Note  that  the  informa¬ 
tion  extracted  from  each  camera  node  is  just  a 
decomposition  of  the  image  domain  with  a  list  of 
times  at  which  detections  were  made.  The  com¬ 
munication  requirements  are  minimal  due  to  this 
reduction  on  the  data. 

The  complex  is  built  by  combining  all  lo¬ 
cal  information  from  the  camera  nodes.  Each 
camera  node  transmits  the  history  of  its  de¬ 
tections  wirelessly  to  a  central  computer  that 
creates  the  CWcomplex.  The  resulting  com¬ 
plex  contains  the  maximal  simplices:  [la  16  46], 
[16  2a  26],  [26  3a  36],  [36  4a  46  56],  [36  56  6], 
and  [5a  56  6].  A  pictorial  representation  of  the 
complex  is  shown  in  figure  9  (right  plots). 
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Figure  8:  Left:  View  of  camera  5  from  the  layout 
in  figure  7  before  any  bisecting  lines  are  found. 
Right:  Same  view  after  a  bisecting  line  (shown 
in  black)  has  been  found. 


As  mentioned  earlier,  this  representation  can 
then  be  used  for  tracking  and  navigation  with¬ 
out  actual  metric  reconstruction  of  the  environ¬ 
ment.  Figure  9  shows  a  set  of  recorded  paths  for 
our  robot.  By  determining  which  simplices  are 
visited  by  the  robot’s  path  we  can  extract  a  path 
in  the  complex  as  shown  by  the  red  path  in  the 
complexes  of  figure  9.  The  main  advantage  of 
this  representation  is  that  the  path  in  the  com¬ 
plex  gives  a  global  view  of  the  trajectory  of  the 
robot,  while  local  information  can  be  extracted 
from  single  camera  views. 

It  is  possible  to  identify  paths  in  the  simpli- 
cial  representation  that  are  homotopic  (i.e.,  that 
can  be  continuously  deformed  into  one  another). 
The  tools  required  for  these  computations  are 
already  available  to  us  from  appendix  A.  In  par¬ 
ticular,  by  taking  two  paths  that  start  and  end 
at  the  same  locations  and  forming  a  loop,  we 
can  verify  that  they  are  homotopic  if  they  are 
the  boundary  of  some  combination  of  simplices. 
Equivalently,  since  a  closed  loop  a  is  just  a  col¬ 
lection  of  edges  in  Ci,  we  need  to  check  whether 
the  loop  a  is  in  Bi  (i.e.,  in  the  range  of  82) • 
This  is  just  a  simple  algebraic  computation.  By 
putting  the  top  and  middle  paths  from  figure  9 
together  we  note  that  the  resulting  loop  is  not 
in  the  range  of  82  (i-e.,  they  are  not  homotopic). 


Figure  9:  Several  paths  for  our  robot  in  the  maze 
(highlighted  in  red  on  the  left)  and  correspond¬ 
ing  mapping  to  the  (7V-complex  (highlighted  in 
red  on  the  right).  These  paths  can  be  easily 
compared  to  each  other  by  using  the  algebraic 
topological  tools  covered  in  appendix  A.  The 
problem  reduces  to  that  of  simple  linear  algebra 
computations. 


On  the  other  hand,  the  top  and  bottom  paths 
can  be  easily  checked  to  be  homotopic. 

Similarly,  for  coordinate-free  navigation  pur¬ 
poses,  this  representation  can  be  used  to  deter¬ 
mine  the  number  of  distinct  paths  from  one  loca¬ 
tion  to  another.  It  is  also  possible  to  find  paths  in 
the  CV-complex  and  use  local  information  from 
each  camera  to  generate  a  physical  path  in  the 
environment . 
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7  Summary  and  Comments 

An  algebraic  representation  of  a  camera  network 
coverage  is  obtained  through  the  use  of  discrete 
observations  from  each  camera  node.  The  math¬ 
ematical  tools  used  for  this  purpose  are  those 
of  algebraic  topology.  In  particular,  we  showed 
that  given  enough  observations  our  model  does 
capture  the  correct  topological  information. 

The  experiment  using  wireless  camera  nodes 
illustrates  how  this  simplicial  representation  of 
the  camera  coverage  can  be  used  to  track  and 
compare  paths  in  a  wireless  camera  network 
without  any  metric  calibration  information.  In 
particular,  these  results  can  also  be  extended  to 
coordinate-free  navigation,  where  our  represen¬ 
tation  can  give  an  overall  view  of  how  to  ar¬ 
rive  at  a  specific  location,  and  the  transitions 
between  simplicial  regions  can  be  accomplished 
in  the  physical  space  by  local  visual  feedback 
from  single  camera  views.  Using  this  proposed 
model  allows  for  local  processing  at  each  node 
and  minimal  wireless  communication.  A  list  of 
times  at  which  occlusion  events  were  observed 
is  all  that  needs  to  be  transmitted.  Also,  inte¬ 
ger  operations  are  all  that  are  required  to  per¬ 
form  the  algebraic  operations  described  in  this 
paper  [8] ,  which  opens  the  doors  to  potential  dis¬ 
tributed  implementation  on  platforms  with  low- 
computational  power. 
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A  Mathematical  Background 

In  this  section  we  cover  the  concepts  from  algebraic  topol¬ 
ogy  that  will  be  used  throughout  this  paper.  This  section 
contains  material  adapted  from  [7,  6]  and  it  is  not  in¬ 
tended  as  a  formal  introduction  to  the  topic.  For  a  proper 
introduction  to  the  topic,  the  reader  is  encouraged  to  read 
[13,  8,  7]. 

A.l  Simplicial  Homology 

Definition  6  Given  a  collection  of  vertices  V  we  define 
a  /c-simplex  as  a  set  [vi  V2  vs  ...  r’/c+i]  where  Vi  ^  V 
and  Vi  V  for  all  i  V  j  •  Also,  if  A  and  B  are  simplices 
and  the  vertices  of  B  form  a  subset  of  the  vertices  of  A, 
then  we  say  that  B  is  a  face  of  A. 

Definition  7  A  finite  collection  of  simplices  in  BT  is 
called  a  simplicial  complex  if  whenever  a  simplex  lies 
in  the  collection  then  so  does  each  of  its  faces. 

Definition  8  The  nerve  complex  of  a  collection  of  sets 
S  =  {Si}fLi,  for  some  N  >  0,  is  the  simplicial  com¬ 
plex  where  vertex  vi  corresponds  to  the  set  Si  and  its  k- 
simplices  correspond  to  non-empty  intersections  of  k  -\-  1 
distinct  elements  of  S. 

The  following  statements  define  some  algebraic  struc¬ 
tures  using  these  simplices. 

Definition  9  Let  {si}fLi  (for  some  N  >  0)  be  the  k- 
simplices  of  a  given  complex.  Then,  the  group  of  k- 
chains  Ck  is  the  free  abelian  group  generated  by  {si}. 
That  is, 

cr  G  Ck  iff  O'  =  diSi  +  O2S2  +  •  •  •  O-nSn 

for  some  G  Z.  If  there  are  no  k -simplices,  then  Ck 
0.  Similarly,  C-i  :=  0. 

Definition  10  Let  the  boundary  operator  dk  applied 
to  a  k-simplex  s,  where  s  =  [vi  V2  ■  ■  ■  Vk-\-i],  be  defined 
by: 

fc+i 

dkS  =  V2  “‘Vi-iVi^i  “‘VkVk+l], 

and  extended  to  any  0  ^  Ck  by  linearity. 
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A  k-chain  a  E  Ck  is  called  a  cycle  if  dkcr  =  0.  The 
set  of  k- cycles,  denoted  by  Z\^,  is  the  ker  dk  and  forms  a 
subgroup  of  Ck-  That  is, 

Zk  :=  ker  dk- 

A  k-chain  a  ^  Ck  is  called  a  boundary  if  there  exists 
p  G  Ck+i  such  that  dk-\-ip  =  cr.  The  set  of  k-boundaries, 
denoted  by  Bk;  is  the  image  of  dk+i  and  it  is  also  a  sub¬ 
group  ofCk-  That  is, 

Bk  :=  im  dk+i- 

Even  further,  we  can  check  that  dk{dk-\-icr)  =  0  for  any 
a  G  Ck+i,  which  implies  that  Bk  is  a  subgroup  of  Zk- 

Observe  that  the  boundary  operator  dk  maps  a  k- 
simplex  to  its  (k  —  l)-simplicial  faces.  Further,  the  set 
of  edges  that  form  a  closed  loop  are  exactly  what  we  de¬ 
note  by  the  group  of  1-cycles.  We  will  be  interested  in 
hnding  out  holes  in  our  domains;  that  is,  cycles  that  can¬ 
not  be  obtained  from  boundaries  of  simplices  in  a  given 
complex.  This  observation  motivates  the  dehnition  of  the 
homology  groups. 

Definition  11  The  k-th  homology  group  is  the  quo¬ 
tient  group 

Hk  ZkjBk- 

The  homology  of  a  complex  is  the  collection  of  all  ho¬ 
mology  groups.  The  rank  of  Hk,  denoted  the  k-th  betti 
number  jSk,  gives  us  a  coarse  measure  of  the  number  of 
holes.  In  particular,  Pq  is  the  number  of  connected  compo¬ 
nents  and  pi  is  the  number  of  loops  that  enclose  different 
“holes’’  in  the  complex. 

A.  2  Example 

In  hgure  10  we  observe  a  collection  of  triangular  shaped 
sets  labeled  from  1  to  5.  The  nerve  complex  is  obtained 
by  labeling  the  0-simplices  (i.e.,  the  vetices)  in  the  same 
way  as  the  sets.  The  1-simplices  (i.e.,  the  edges  in  the 
pictorial  representation)  correspond  to  pairwise  intersec¬ 
tion  between  the  regions.  The  2-simplex  correspond  to 
the  intersection  between  triangles  2,  4  and  5. 

For  the  group  of  0-chains  Co,  we  can  identify  the 
simplices  {[1],  [2],  [3],  [4],  [5]}  with  the  column  vectors 
{ui,  U2,  U3,  U4,  Us},  where  vi  =  [1,  0,  0,  0,  0]^  and  so  on. 

For  Cl,  we  identify  {[1  2],  [2  3],  [2  4],  [2  5],  [3  5],  [4  5]} 
with  the  column  vectors  {ei,  62,  63,  64,  65,  ee},  where  we 
dehne  ei  =  [1,  0,  0,  0,  0,  0]^  and  so  on. 


Figure  10:  On  the  left,  a  collection  of  sets  is  displayed. 
On  the  right,  a  pictorial  representation  of  the  correspond¬ 
ing  nerve  complex.  The  complex  is  formed  by  the  sim¬ 
plices:  [1],  [2],  [3],  [4],  [5],  [1  2],  [2  3],  [2  4],  [2  5],  [3  5], 
[4  5]  and  [2  4  5].  Note  that  the  actual  coordinates  of  the 
points  are  irrelevant  for  our  pictorial  representation. 


Similarly  for  C2,  we  identify  [2  4  5]  with  /i  =  1. 

As  we  mentioned  before,  dk  is  the  operator  that  maps 
a  simplex  a  G  C/e  to  its  boundary  faces.  For  example,  we 
have: 

d2[2  4  5]  =  [4  5]  -  [2  5]  +  [2  4]  iff  &/i  =  eg  -  64  +  63, 

di  [2  4]  =  [4]  —  [2]  iff  di e^  —  va  —  V2- 
That  is,  dk  can  be  expressed  in  matrix  form  as: 


0 

■  -1 

0 

0 

0 

0 

0 

n 

1 

-1 

-1 

-1 

0 

0 

u 

1 

0 

1 

0 

0 

-1 

0 

1 

-1 

0 

0 

1 

0 

0 

-1 

n 

0 

0 

0 

1 

1 

1 

u 

1 

Since  C-i  =  0, 

Hq  =  Zq!  Bq  =  ker  do  fim  di  —  im  di . 

We  can  verify  that 

pQ  —  dim^Hf)  —  1. 

Hence,  we  recover  the  fact  that  we  have  only  one  con¬ 
nected  component  in  the  diagram  of  hgure  10.  Similarly, 
we  can  verify  that 

Pi  —  dim{Hi)  —  dim{Zi/ Bi)  —  dim{ker  difim  ^2)  =  1, 

which  tells  us  that  the  number  of  holes  in  our  complex  is 
1.  Also,  Hk  =  0  for  k  >  1  (since  Ck  =  0). 

A. 3  Cech  Theorem 

Now  we  introduce  the  Cech  Theorem  which  has  been  used 
in  the  context  of  sensor  networks  with  unit-disk  coverage 
[6]  and  has  been  proved  in  [3] .  Before  we  proceed  further, 
we  will  require  the  following  dehnition: 
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Definition  12  Given  two  spaces  X  andY ,  a  homotopy 
between  two  continuous  functions  fo  :  X  ^  Y  and  f\  : 
X  ^Y  is  a  continuous  1 -parameter  family  of  continuous 
functions  ft’.X^Y  for  t  G  [0, 1]  connecting  fo  to  fi. 

Definition  13  Two  spaces  X  and  Y  are  said  to  be  of  the 
same  homotopy  type  if  there  exist  functions  f  :  X  ^  Y 
and  g  :  Y  ^  X  with  g  o  f  homotopic  to  the  identity  map 
on  X  and  fog  homotopic  to  the  identity  map  on  Y . 

Definition  14  A  set  X  is  contractible  if  the  identity 
map  on  X  is  homotopic  to  a  constant  map. 

In  other  words,  two  functions  are  homotopic  if  we  can 
continuously  deform  one  into  the  other.  Also,  a  space  is 
contractible  if  we  can  continuously  deform  it  to  a  single 
point.  It  is  known  that  homologies  are  an  invariant  of 
homotopy  type;  that  is,  two  spaces  with  the  same  homo¬ 
topy  type  will  have  the  same  homology  groups. 

Theorem  2  (Cech  Theorem)  If  the  sets  {Si}^i  (for 
some  A"  >  0^  and  all  nonempty  finite  intersections  are 
contractible,  then  the  union  IJ^i  homotopy 

type  of  the  nerve  complex. 

That  is,  given  that  the  required  conditions  are  satis¬ 
fied,  the  topological  structure  of  the  union  of  the  sets  is 
captured  by  the  nerve.  We  observe  that  in  figure  10  all 
of  the  intersections  are  contractible.  Therefore,  we  can 
conclude  that  the  extracted  nerve  complex  has  the  same 
homology  as  the  space  formed  by  the  union  of  the  trian¬ 
gular  regions. 

B  Proof  of  Theorem  1 

Throughout  this  section  we  consider  a  finite  set  of  cam¬ 
eras  indexed  by  a  G  {1,2,3- • -Ac}  with  corresponding 
domains  and  coverage  Ca.  Each  camera  coverage  is 
decomposed  by  all  possible  bisecting  lines  {Lq,,^}.  The 
collection  {Cq;,^}  is  the  result  of  this  decomposition,  where 
C(x,j  :=  CcK  n  Ka^j  and  is  the  convex  cone  resulting 

from  decomposing  the  plane  using  the  lines  {Lq,,^}  (see 
definition  4). 

Observation  3  It  may  be  useful  for  the  reader  to  think 
of  the  set  Cn  (the  visible  set  after  object  occlusions  have 
been  removed)  as  the  intersection  of  a  convex  set  (i.e.,  the 
camera  domain)  with  a  star  convex  set  (due  to  visibility 
from  Ooc). 


Observation  4  The  number  of  bisecting  lines  for  a  given 
camera  in  our  environment  is  finite  since  we  are  consider¬ 
ing  finite  number  of  object  in  the  coverage  with  piecewise 
linear  boundaries. 

Definition  15  The  line  segment  joining  points  p 
and  q  is  denoted  by  p~q.  The  line  passing  through 

points  p  and  q  is  denoted  by  L{p,q). 

Definition  16  The  triangle  formed  by  points  a,  b 

and  c  G  is  the  convex  hull  of  these  three  points  and  it 
is  denoted  Aa,b,c- 

Lemma  1  Given  that  o^^p  ^  then  oTp  G  Co,- 

Proof 

Since,  o^  and  p  G  Ca  C  then  Oo,  p  C  Pa  due 
to  convexity  of  Pa-  Let  r  G  oTp-  If  r  is  not  visible 
then  oYh  H  |J  Pi  ^  0  (where  {Pi}  is  the  collection  of 
objects  in  the  environment).  However,  this  implies  that 
oTpH  U  Pi  7^  0.  Hence,  we  conclude  that  p  is  not  visible, 
which  is  a  contradiction.  Therefore,  r  must  be  visible. 
Since  r  was  arbitrary  then  ojjp  is  visible.  □ 

Lemma  2  Given  that  p,q  ^  with 
L(p,Oa)  =  L(^,Oa), 

then  p~Q  ^  Ca .  That  is,  if  p  and  q  are  visible  and  are  in 
the  same  line  of  sight,  then  the  line  joining  them  is  visible 
too. 

Proof 

This  follows  from  the  definition  of  Ca  and  the  domain 
of  a  camera  Pa  -  We  know  that  Pa  is  convex,  so  p~q  C  Pa 
since  p,q  E  Co,  C  Vo,. 

From  our  assumption  L(p,  Oa)  =  L(g,  Oa),  it  is  possible 
to  conclude  that  for  r  G  then  r  G  Pa ,  and  r  G  oTp  or 
r  G  Oa  q.  Basically,  there  is  only  two  cases,  both  p  and  q 
on  the  same  side  of  Oa  or  on  opposites  sides.  Either  way, 
r  must  be  in  oTp  or  oXq- 

Without  lost  of  generality,  assume  r  G  oTp-  If  r  was 
not  visible,  the 

Oa  r  n  [J  Pi  /  0 

(where  {Pi}  is  the  collection  of  sets  representing  the  ob¬ 
jects  in  the  space).  This  implies  that 

oTp  n  [J  Pi  /  0, 

since  Oa  r  C  oTp-  This  implies  that  p  ^  Co,  which  is  a 
contradiction.  Therefore,  r  must  be  visible  too.  □ 
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Figure  11:  Steps  on  the  construction  of  a  monotone  convex  path  for  lemma  6. 


Lemma  3  Given  a  closed  path  r([0, 1])  C  Ca,  then  the 
space  enclosed  by  F  is  also  in  Ca  ■ 

Proof 

Let  IZ  be  the  enclosed  area  by  the  path  F.  Since  F  : 
[0, 1]  ^  is  bounded,  then 

3M  >  0  such  that  ||r(t)  —  Oa||  <  M, 

where  Oa  is  the  location  of  camera  a.  Hence, 

r  ^  7^  if  ||r  —  Oa||  >  M. 

Also,  if  a  point  r'  is  connect  to  r  ^  7^  through  a  path  7 
that  does  not  cross  L,  then  r'  ^IZ. 

Let  p  ^IZ  and  define 

C  :=  L(p,  Oa)  n  r([o,  1]) 

(i.e.,  points  in  L  and  in  the  line  passing  through  p  and 
Oo;),  then  there  must  be  points  ^1,^2  G  C  such  that  and 
p  G  qi  q2-  Otherwise,  there  would  exist  a  point  r  G  L  with 
||r  — OckII  >  M  (i.e.,  r  ^  7Z)  such  that  Vp  does  not  intersect 
r([0, 1]).  This  implies  p  ^  IZ  which  is  a  contradiction. 
Therefore,  p  G  qi  q2- 

Next,  we  consider  three  cases: 

•  Assume  qi  ^  o^  and  ^2  7^  On-  Since  ^1,^2  G 
r([0, 1])  C  Cn  with  L{qi,Oa)  =  L(^2,Oa),  then 
p  G  qi  q2  C  Cn  by  lemma  2  (which  makes  p  visi¬ 
ble). 

•  Assume  qi  7^  Oa  and  ^2  =  Oa-  Then  p  G  Oaq2  C  Ca 
by  lemma  1. 

•  Assume  =  o^.  Then,  p  =  Oa  ^Ca¬ 

in  all  cases  p  is  visible,  and  since  p  was  arbitrary  we 

conclude  tha  IZ  is  visible.  □ 

The  previous  lemmas  are  also  true  if  we  replace  Ca  by 
the  set  Ca,j  resulting  from  a  decomposition  of  the  cover¬ 
age.  The  reason  why  it  works  is  because  we  can  think  of 
Ca,j  as  being  the  coverage  of  a  camera  with  a  domain 

Va^j  l^VaHKaJ, 


where  Ka,j  is  the  corresponding  convex  cone  that  gen¬ 
erates  the  region  Ca,j-  This  new  domain  is  still  convex 
which  is  the  property  used  in  the  previous  lemmas.  How¬ 
ever,  note  that  this  T>a,j  is  not  open. 

Lemma  4  Every  connected  component  0/ 

where  A  is  a  finite  set  of  indices,  is  simply  connected. 

Proof  Let  T  be  a  closed  loop  in  0(0;  j)GA 

previous  lemma,  the  space  enclosed  by  T  is  inside  Ca,j  for 

all  (a,j)  G  A.  □ 

Definition  17  Let  T  :  [0, 1]  ^  R^  be  a  path  connecting 
points  p  to  q  (i.e.  r(0)  =  p  and  r(l)  =  q).  We  define 
the  region  enclosed  by  T,  denoted  by  LZ(F),  to  be  the 
region  enclosed  by  the  set  r([0, 1])  Gfiq. 

Definition  18  A  path  F  :  [0, 1]  ^  R^  connecting  points 
p  and  q  is  said  to  be  a  convex  path  z/7^(r)  is  convex. 

Definition  19  A  non-intersecting  path  F  :  [0, 1]  ^  R^ 
is  monotone  with  respect  to  camera  a  if  for  any 

p  G  where  is  the  unit  circle  centered  at  Oa,  we  have 
that  r([0, 1])  nL(p,  Oq;)  has  a  single  connected  component. 

Lemma  5  Let  IZ  be  a  bounded  convex  set  contained  be¬ 
tween  the  lines  L(p,  Oa)  and  L(q,  Oa),  where  p  and  q  ElZ. 
Then,  either  p~q  is  the  only  path  in  IZ  joining  p  to  q,  or 
there  are  exactly  two  distinct  images  of  monotone  paths 
connecting  p  to  q  (only  intersecting  at  the  end  points), 
which  form  the  boundary  oflZ. 

The  hgure  above  illustrates  the  results  from  the  previ¬ 
ous  lemma. 

Lemma  6  Given  that  Ca  is  connected  withp  and  q  E  Ca, 
then  there  exists  a  path  F  connecting  these  points  that 
is  convex  and  monotone  with  respect  to  camera  a  with 

r([0,  1])  CCaH  Ap.q,o„. 
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Proof 

We  present  an  outline  of  the  proof  of  this  result. 

Let  p  and  q  where  is  connected. 

The  reader  may  be  tempted  to  try  the  path  p"d^UdT^. 
However,  we  are  not  assuming  Our  proof  takes 

care  of  this  case  too. 

Since  is  connected  then  there  exists  a  path  To  that 
connects  p  lo  q  with  ro([0, 1])  C  Ca.  We  illustrate  this 
in  the  diagram  in  figure  11  (a)  in  which  the  gray  region 
corresponds  to  the  coverage  under  consideration. 

Our  first  objective  will  be  construct  a  path  that  is  con¬ 
tained  within  Ap^q^oa  • 

We  start  with  path  To  and  consider  the  line  L(p,Oa) 
(see  figure  11  (a)  ).  This  line  will  intersect  the  To  at 
points  {rfe}.  By  lemma  2,  we  know  that  the  line  segments 
between  them  are  visible,  so  we  can  construct  path  Ti  (as 
shown  in  figure  11  (b))  which  does  not  cross  L(p,  Oa). 

Next,  we  consider  the  intersections  between  L(q,Oa) 
and  Ti  (see  figure  11  (b)  ).  As  in  the  previous  case,  we 
can  built  a  path  r2  which  does  not  cross  L(g,  Oq,). 

If  we  consider  the  line  L{p,q),  then  it  will  intersect 
the  line  r2  at  points  {rk}  (see  figure  11  (c)  ).  Consider 
a  segment  of  r2  that  is  outside  of  the  triangle  Ap^q^oa^ 
which  intersects  L{p,q)  at  ri  and  r2.  For  any  r  G  ri  r2, 
we  see  that  r  G  T>cx  since  rk  G  T>a  and  T>a  is  convex.  Also, 
there  exists  of  a  point  r'  G  L(r,Oa)  H  r2([0, 1])  which  is 
further  away  from  Oa  than  r.  Otherwise,  the  line  segment 
in  r  between  ri  and  r2  would  not  be  outside  the  Ap^q^oa- 
Therefore,  if  r  was  not  visible  then  r  would  not  be  visible 
which  is  a  contradiction.  Hence,  r  must  be  visible. 

This  implies  that  we  can  connect  ri  to  r2  by  the 
line  segment  ri  r2  and  construct  path  Ta  which  is  inside 

Ap^q^Ocx  • 

In  order  to  make  Ta  into  a  convex  path,  we  take  the 
convex  hull  of  Ta  and  by  lemma  5  we  know  that  there  are 
at  most  two  monotone  paths  to  choose  from  (see  figure  1 1 
(d)).  We  choose  the  path  T  that  is  closest  to  Oq,.  Clearly 
r  is  convex.  We  can  see  that  T  is  visible  since  for  any  line 
L(r,  Oa)  for  r  G  ra([0, 1]),  the  line  will  have  to  intersect  T 
at  some  location  s  closer  to  than  r. 


This  process  yields  the  desired  monotone  and  convex 
path  r  (see  figure  11  (e))  which  images  is  in  Ca  H  Ap,oa,g- 
□ 

Lemma  7  Given  that  is  connected  with  p  and  q  G 
Ccxj  for  some  j ,  then  there  exists  a  path  T  connecting 
these  points  that  is  convex  and  monotone  with  respect  to 
camera  a  with  r([0, 1])  C  Caj  H  Ap^q^oa  ■ 

Proof  Since  p  and  q  G  then  p  and  g  G  fl 
By  the  previous  lemma,  we  know  that  there  exists  a  path 
r  such  that  r([0,l])  C  Cq,.  Note  that  r([0, 1])  is  in¬ 
side  the  cone  formed  by  the  lines  L{p,Oa)  and  L{q,Oa) 
by  construction.  This  cone  must  be  contained  within 
otherwise  p  and  q  could  not  be  in  There¬ 

fore,  r([0,  1])  C  Ka^j  GCot  —  Coc,j.  □ 

Lemma  8  Let  V  :  [0, 1]  ^  ^  he  a  feasible  monotone 
path  connecting  p  and  q  ^  Ca.  with  r([0, 1])  C  T>a  for  some 
camera  a.  If  an  object  O  is  within  the  region  enclosed 
by  oTp  U  r([0, 1])  U  TdT  then  there  exist  a  bisecting  line 
L  passing  through  a  point  in  T  that  does  not  intersect 
L(p,Oa)  and  L(q,Oa)  (not  including  these  lines). 

Proof 

For  simplicity  we  just  give  an  outline  to  this  proof. 

Since  F([0, 1])  C  we  know  that  no  point  in  F  will 
be  in  the  boundary  of  T>a  since  T>a  is  open. 

Since  ojjp  U  F([0, 1])  U  encloses  an  object,  there 
exists  a  transition  between  having  a  visible  and  a  not- 
visible  point  in  the  path  (i.e.  an  occlusion  event).  This 
is  guaranteed  since  at  least  a  point  in  the  path  is  visible, 
and  not  all  the  points  can  be  visible  due  to  the  object  O. 

Assume  that  the  transition  event  occurs  in  L(p,Ocx) 
or  L{q,  Oa)  at  some  point  r  G  F([0, 1])  and  no  where  else. 
Without  lost  of  generality  assume  that  rp  C  F([0, 1])  (due 
to  monotonicity  of  path).  The  object  would  have  to  oc¬ 
clude  r  too  (since  objects  are  closed).  Then  either  the 
path  is  not  feasible  or  p  is  not  visible  which  contradicts 
our  assumption.  Therefore,  a  transition  must  occur  in 
some  other  point  along  F  and  not  in  these  lines.  □ 

Theorem  3  (Decomposition  Theorem)  Let  {Ca}a=i 
be  a  collection  of  camera  coverage  where  each  Ca  is  con¬ 
nected  and  N  is  the  number  of  cameras  in  the  domain. 
Let  {Ca,k}{(x,k)eAD  collection  of  decomposed  sets 

by  all  possible  bisecting  lines,  where  Ad  is  the  set  of  in¬ 
dices  in  the  decomposition.  Then,  any  finite  intersection 
0(0;'  k')eA^c^'  LA  where  A  is  a  finite  set  of  indices,  is  con¬ 
tractible. 
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Proof  For  simplicity  we  just  give  an  outline  to  this  proof 
for  two  cameras.  The  proof  for  multiple  cameras  can  be 
completed  by  induction. 

Let  p  and  q  G  Cai,/ci  H  0^2 M  foi*  some  indices  (ai,  ki). 

Part  I: 

First,  consider  cameras  ai  and  02  in  the  same  side  of 
the  line  L(p,q).  We  know  that  there  exist  convex  mono¬ 
tone  paths  Fi  connecting  p  to  q  such  that  Fi([0, 1])  C 
Cai,ki  n  Ap^ooc-,q  for  z  =  1,2  (see  left  plot  in  figure  12). 


Figure  13:  Illustrations  for  Case  1. 


P  q  p  q 


Figure  12:  Illustration  of  the  construction  of  F. 

By  lemma  5,  we  can  choose  a  path  F  corresponding  to 
a  segment  of  the  boundary  of  7^  :=  7^(Fi)  n  7^(F2)  (see 
right  plot  in  figure  12).  We  choose  the  path  that  consists 
of  segments  from  Fi  and  F2  so  F  will  be  feasible.  We 
note  that  lemma  5  also  tells  us  that  F  is  monotone  with 
respect  to  camera  ai  (since  71  is  between  L(p,Oa.)  and 
L(g,OcJ).  Also, 

F  c  =  7^(Fl)  n  7^(F2)  C  Pai  n  Pa2 
due  to  convexity  of  7)^^ . 

By  lemma  8,  we  know  that  there  are  no  objects  in¬ 
side  the  regions  enclosed  by  poa^  U  F([0, 1])  U  qo^^  (since 
otherwise  there  would  be  a  bisecting  line  and  we  as¬ 
sumed  that  we  already  decomposed  using  all  bisecting 
lines).  Hence,  s  Oq,.  does  not  intersect  any  object  for 
s  G  F([0, 1]),  which  implies  that  F  is  visible  by  both  cam¬ 
eras  (i.e.  F([0, 1])  FL  Ccxijki  F\Ccx2-,k2^' 

Part  II: 

Now  we  consider  cameras  ai  and  a2  at  opposite  sides 
of  the  line  L(p,  q).  There  are  two  main  cases  to  consider. 
Case  1: 

For  the  first  case  we  consider  a  configuration  as  seen 
in  figure  13  (left). 

We  would  like  to  conclude  that  the  path  p~q  is  visible 
by  both  cameras.  Assume,  it  is  only  visible  up  to  a  point  r 
(not  including  this  point  since  objects  are  closed).  Then, 
an  object  must  intersect  roW  oi*  r  lemma  8, 

we  notice  that  the  interior  of  Ar,oc.,q  must  be  empty; 
otherwise,  there  would  be  a  bisecting  line. 


Also,  since  1)^2  is  open,  we  can  find  a  ball  B  around 
q  that  is  contained  in  (see  right  plot  in  figure 

13 (right)).  We  choose  a  point  s  G  B  H  q~o7C- 

Then,  Ap^s.q  C  nPa2-  From  there,  we  can  choose 
a  point  s'  as  shown  in  the  diagram  such  that  s'  q  C  77^2 
is  feasible.  This  is  possible  since  there  are  no  objects 
in  Ar,ooc-y,q-  However,  if  there  was  an  object  intersecting 
r  0^2,  then  L(r,  00,2)  would  be  a  bisecting  line.  But,  it  is 
not.  So,  there  is  no  objects  interseting  r~o7C- 

Similarly,  there  are  no  objects  intersecting  roW-  This 
means  that  r  is  visible  by  both  cameras,  which  contradicts 
our  initial  assumption.  This  shows  that  p~q  is  visible  by 
both  cameras,  i.e.  p~q  C  CCot2M- 

Case  2: 


Nevertheless, 


For  the  second  case,  we  con¬ 
sider  a  configuration  as  shown  to 
the  left. 

By  following  the  same  analy¬ 
sis  as  before,  we  can  show  that 
pr  —  {r}  and  r~q  —  {r}  must  be 
visible  by  both  cameras.  How¬ 
ever,  we  could  have  an  object  in 
objects  must  enclose  some  area 


^0.1  '-'OL2  • 

which  does  not  allow  an  object  to  be  contained  in  this 
line.  Therefore,  p~q  C  Cai,/ci  F\Cci2,k2-  ^ 
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